智能论文笔记

Two Is Better Than One: Dual Embeddings for Complementary Product Recommendations

Giorgi Kvernadze , Putu Ayu G. Sudyanti , Nishan Subedi , Mohammad Hajiaghayi

分类：机器学习

2022-11-28

Embedding based product recommendations have gained popularity in recent years due to its ability to easily integrate to large-scale systems and allowing nearest neighbor searches in real-time. The bulk of studies in this area has predominantly been focused on similar item recommendations. Research on complementary item recommendations, on the other hand, still remains considerably under-explored. We define similar items as items that are interchangeable in terms of their utility and complementary items as items that serve different purposes, yet are compatible when used with one another. In this paper, we apply a novel approach to finding complementary items by leveraging dual embedding representations for products. We demonstrate that the notion of relatedness discovered in NLP for skip-gram negative sampling (SGNS) models translates effectively to the concept of complementarity when training item representations using co-purchase data. Since sparsity of purchase data is a major challenge in real-world scenarios, we further augment the model using synthetic samples to extend coverage. This allows the model to provide complementary recommendations for items that do not share co-purchase data by leveraging other abundantly available data modalities such as images, text, clicks etc. We establish the effectiveness of our approach in improving both coverage and quality of recommendations on real world data for a major online retail company. We further show the importance of task specific hyperparameter tuning in training SGNS. Our model is effective yet simple to implement, making it a great candidate for generating complementary item recommendations at any e-commerce website.

translated by 谷歌翻译

Probabilistically Robust PAC Learning

VInod Raman , Unique Subedi , Ambuj Tewari

分类：机器学习 | (统计)机器学习

2022-11-10

Recently, Robey et al. propose a notion of probabilistic robustness, which, at a high-level, requires a classifier to be robust to most but not all perturbations. They show that for certain hypothesis classes where proper learning under worst-case robustness is \textit{not} possible, proper learning under probabilistic robustness \textit{is} possible with sample complexity exponentially smaller than in the worst-case robustness setting. This motivates the question of whether proper learning under probabilistic robustness is always possible. In this paper, we show that this is \textit{not} the case. We exhibit examples of hypothesis classes $\mathcal{H}$ with finite VC dimension that are \textit{not} probabilistically robustly PAC learnable with \textit{any} proper learning rule. However, if we compare the output of the learner to the best hypothesis for a slightly \textit{stronger} level of probabilistic robustness, we show that not only is proper learning \textit{always} possible, but it is possible via empirical risk minimization.

translated by 谷歌翻译

FetReg2021: A Challenge on Placental Vessel Segmentation and Registration in Fetoscopy

Sophia Bano , Alessandro Casella , Francisco Vasconcelos , Abdul Qayyum , Abdesslam Benzinou , Moona Mazher , Fabrice Meriaudeau , Chiara Lena , Ilaria Anita Cintorrino , Gaia Romana De Paolis

分类：人工智能 | 计算机视觉 | 机器学习

2022-06-24

胎儿镜检查激光光凝是一种广泛采用的方法，用于治疗双胞胎输血综合征（TTTS）。该过程涉及光凝病理吻合术以调节双胞胎之间的血液交换。由于观点有限，胎儿镜的可操作性差，可见性差和照明的可变性，因此该程序尤其具有挑战性。这些挑战可能导致手术时间增加和消融不完全。计算机辅助干预措施（CAI）可以通过识别场景中的关键结构并通过视频马赛克来扩展胎儿镜观景领域，从而为外科医生提供决策支持和背景意识。由于缺乏设计，开发和测试CAI算法的高质量数据，该领域的研究受到了阻碍。通过作为MICCAI2021内窥镜视觉挑战组织的胎儿镜胎盘胎盘分割和注册（FETREG2021）挑战，我们发布了第一个Largescale Multencentre TTTS数据集，用于开发广义和可靠的语义分割和视频摩擦质量algorithms。对于这一挑战，我们发布了一个2060张图像的数据集，该数据集是从18个体内TTTS胎儿镜检查程序和18个简短视频剪辑的船只，工具，胎儿和背景类别的像素通道。七个团队参与了这一挑战，他们的模型性能在一个看不见的测试数据集中评估了658个从6个胎儿镜程序和6个短剪辑的图像的图像。这项挑战为创建通用解决方案提供了用于胎儿镜面场景的理解和摩西式解决方案的机会。在本文中，我们介绍了FETREG2021挑战的发现，以及报告TTTS胎儿镜检查中CAI的详细文献综述。通过这一挑战，它的分析和多中心胎儿镜数据的发布，我们为该领域的未来研究提供了基准。

translated by 谷歌翻译

Sim2Real Docs: Domain Randomization for Documents in Natural Scenes using Ray-traced Rendering

Nikhil Maddikunta , Huijun Zhao , Sumit Keswani , Alfy Samuel , Fu-Ming Guo , Nishan Srishankar , Vishwa Pardeshi , Austin Huang

分类：计算机视觉 | 人工智能 | 机器学习

2021-12-16

过去，用于数字化文件的计算机视觉系统可以依赖于系统捕获的高质量扫描。今天，涉及数字文件的交易更有可能从非专业人士拍摄的手机照片上传。因此，文档自动化的计算机愿景现在必须考虑自然场景上下文中捕获的文档。额外的挑战是，文档处理的任务目标可以是高度用例特定的，这使得公共数据集在其实用程序中有限，而手动数据标签也昂贵并且在使用情况之间翻译不当。要解决这些问题，我们创建了SIM2REAL文档 - 一个合成数据集的框架，并在自然场景中执行文档的域随机化。 SIM2REAL文档使使用BLENDER，一个用于3D建模和光线跟踪渲染的开源工具的文档的程序化3D渲染。通过使用渲染来模拟光，几何，相机和背景的物理交互，我们在自然场景上下文中综合文档数据集。每个渲染都与使用案例特定的地面真理数据配对，指定感兴趣的潜在特征，产生无限制的拟合培训数据。然后，机器学习模型的作用是为了解决渲染管道构成的逆问题。通过微调或调整域随机化参数，可以进一步迭代这种模型。

translated by 谷歌翻译

Grounded Relational Inference: Domain Knowledge Driven Explainable Autonomous Driving

Chen Tang , Nishan Srishankar , Sujitha Martin , Masayoshi Tomizuka

分类：人工智能 | 机器学习 | 机器人

2021-02-23

解释性对于自主车辆和其他机器人系统在操作期间与人类和其他物体相互作用至关重要。人类需要了解和预测机器采取的行动，以获得可信赖和安全的合作。在这项工作中，我们的目标是开发一个可解释的模型，可以与人类领域知识和模型的固有因果关系一致地产生解释。特别是，我们专注于自主驾驶，多代理交互建模的基本构建块。我们提出了接地的关系推理（GRI）。它通过推断代理关系的相互作用图来模拟交互式系统的底层动态。我们通过将关系潜空间接地为具有专家域知识定义的语义互动行为来确保语义有意义的交互图。我们展示它可以在模拟和现实世界中建模交互式交通方案，并生成解释车辆行为的语义图。

translated by 谷歌翻译

Automated Human Activity Recognition by Colliding Bodies Optimization-based Optimal Feature Selection with Recurrent Neural Network

Pankaj Khatiwada , Matrika Subedi , Ayan Chatterjee , Martin Wulf Gerdes

分类：机器学习

2020-10-07

在智能医疗保健中，人类活动识别（Har）被认为是传感器读数的普遍计算中的有效模型。家庭或社区中的环境辅助生活（AAL）有助于人民提供独立的护理和增强的生活质量。然而，许多AAL模型使用包括计算成本和系统复杂性的许多因素来限制。此外，由于其应用，HAR概念具有更多相关性。因此，本文旨在使用深度学习来实现来自智能传感器收集的数据，该数据在UC IRVINE机器学习存储库（UCI）中公开提供。所提出的模型涉及三个过程：（1）数据收集，（b）最佳特征选择，（c）识别。从基准存储库收集的数据最初遵循最佳特征选择，有助于选择最重要的功能。所提出的最佳特征选择是基于一种名为碰撞体优化（CBO）的新的元启发式算法。通过识别精度导出的目标函数用于完成最佳特征选择。这里，被称为经常性神经网络（RNN）的深度学习模型用于活动识别。相关基准数据集的提出模型优于现有的学习方法，与传统模型相比提供高性能。

translated by 谷歌翻译